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ABSTRACT 


The  paper  considers  stochastically  constrained  nonlinear  program¬ 
ming  problems.  A  penalty  type  method  is  suggested  as  a  deterministic 
surrogate.  The  penalty  is  constructed  in  terms  of  a  "distance"” 

'  A 

function  between  random  variables, given  in  term  of  the  (((-divergence 
functional  (a  generalization  of  the  relative  entropy) .  A  duality  theory 
is  developed  in  which  a  general  relation  between  ^-divergence  and  utility 
functions  is  revealed,  via  the  conjugate  transform,  and  a  new  type  of 


certainty  equivalent  concept  emerges. 


Key  Words:  Stochastic  Programming,  Penalty  Functions,  (((-divergence , 
Entropy,  Conjugate  Duality,  Utility  Functions,  Certainty  Equivalent. 
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1.  Introduction 


In  this  paper,  we  consider  mathematical  programming  problems  with  stochas¬ 
tic  constraints  of  the  form: 


(SP)  inf{gQ(x):  g(x,b)  <_  0} 


where  x  e  £n  is  the  decision  vector,  g^:  £n  -*■  I,  g:  £n  x  £^  •+•  £m  are 
given  functions  and  b  e  £  is  a  random  vector.  A  new  penalty-type  decision 
theoretic  approach  to  treat  problem  (SP)  was  introduced  recently  by  Ben-Tal 
[2].  In  this  approach,  the  stochastic  program  (SP)  is  replaced  by  an 


unconstrained  deterministic  program: 


(DP)  inf  (gQ(x)  +  p  PE(x) } 


where  p  >  0  is  a  penalty  parameter,  and  P£  is  a  penalty  function  for 
violation  of  the  constraints  in  the  mean,  i.e.  P£(x)  *  0  if  E  g(x,b)  <_ 0 ' 
and  P£(x)  >  0  otherwise. 

The  special  feature  of  this  approach  is  the  choice  of  the  penalty 
function  Pp,  which  is  constructed  in  terms  of  the  so-called 
Kullback-Leibler  relative  entropy  functional,  (or  divergence) ,  widely 
used  in  statistical  information  theory,  [9],  [10]. 

If  is  the  set  of  all  generalized  densities  f  of  random  vectors 

z  e  £  with  support  T  (all  absolutely  continuous  with  respect  to  a  common 
non-negative  measure  dt) ,  and  f^  is  a  given  density  in  of  the  random 

vector  b  €  £  ,  then  the  relative  entropy  between  the  random  vectors  z  (with 


<E  denotes  the  expectation  operator  with  respect  to  the  random  vector  b. 
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density  f  6  B,  )  and  b  (with  density  f  )  is 

tt  o 


I(f,£b) 


f<‘>  F&T  dt  ;  f  £  V 

D 


The  penalty  function  P„  is  defined  in  [ 2  1  as  the  following  infinite 

IL  ' 

dimensional  optimization  problem: 


V*> 


inf 

f€B 


f(t)  log 


k  T 


f(t) 

fb(t) 


dt: 


g(x,t)f (t)dt  <_  0  •  (1.1) 

.. 


and  is  accordingly  called  entroplc  penalty. 

Many  attractive  properties  of  the  entropic  penalty  and  of  the  induced 
deterministic  program  (DP)  are  obtained  using  a  fundamental  dual  representation 
of  P£  derived  in  [2] : 

P  (x)  -  sup  {-log  E  e"y  8(x’b)}  . 

E  y>0 

In  particular,  the  dual  expression  of  P_  is  used  to  express  the 
deterministic  problem  (DP)  as  a  saddle  function  problem,  and  for  the  important 
special  case  of  problems  with  stochastic  right  hand  side: 


(SP-RHS)  inf (gQ(x)  :  g(x)  >_  b} 

it  is  shown  there,  that  the  primal  entropic  penalty  program  (DP)  generates 
a  dual  problem  which  consists  of  maximizing  the  certainty  equivalent  of  the 
classical  Lagrangian  dual  function  of  (SP)  : 

hfe(y)  :*  inf  (Lb(x,y)  *  gg(*)  +  yTg(x,b)} 
x 

i.e.,  the  dual  problem  is: 


max  u  "tuCK,  (y)) 
y>0  ° 

where  u  is  an  exponential  utility  function  and  u  1  is  the  inverse  of  u. 

This  interesting  dual  relationship  between  the  minimization  of  the  classical 
relative  entropy  functional  and  the  maximization  of  expected  utility,  give 
rise  to  the  following  natural  questions: 

(1)  Does  such  duality  results  hold  for  arbitrary  utilities,  (not  just 
exponential) ? 

(2)  Assuming  (1)  holds,  what  is  the  corresponding  entropy-type  functional 
involved  in  defining  an  appropriate  penalty  function? 

(3)  How  does  the  new  entropy-type  penalty  relate  to  utility  functions 
and  is  it  appropriate  to  treat  stochastic  programs  (SP)? 

In  this  paper,  we  aim  at  generalizing  and  unifying  the  results  derived 
in  [2],  and  provide  satisfactory  answers  to  the  above  questions.  The  key  to 
the  generalization  is  the  concept  of  4>-divergence ,  1^,  introduced  by 

Csizar  [7].  It  includes  most  of  the  important  entropy  type  functionals  used 
in  mathematical  statistics.  Its  legitimacy  as  a  measure  of  "distance" 
between  probability  distributions  as  well  as  some  of  its  basic  properties 
needed  in  this  paper  are  discussed  in  Section  2.  Adopting  this  concept  here, 
a  generalized  penalty  function  is  defined  by  replacing  in  (1.1)  the  classi¬ 

cal  divergence  I  (based  on  the  special  choice  4>(t)  *  t  log  t)  with  1^. 

In  terms  of  the  (ft-entropic  penalty  P^,  the  stochastic  program  (SP) 
is  replaced  by  a  deterministic  program: 


(DP)<+,  inf  {£0(x)  + 


The  properties  of  P.  and  its  appropriateness  in  treating  stochastic  programs 


(SP)  by  its  deterministic  surrogate  (DP)^  is  discussed  in  Section  5. 

A  crucial  step  in  studying  these  properties  is  the  derivation  of  a  simple 
dual  representation  of  P^,  see  Section  3.  This  representation  also  enables 
us  to  associate  in  a  natural  way,  the  kernel  information  function  {>  with 
a  utility  function  u,  via  the  conjugate  function  {>*  of  <j>.  In  terms  of 
this  utility  function,  we  introduce  in  Section  4  a  new  type  of  certainty 
equivalent  concept,  possessing  for  arbitrary  utilities  many  of  the  properties 
that  the  classical  certainty  equivalent  possesses  only  for  exponential 
utilities.  A  similar  type  of  such  "new  certainty  equivalent"  was  first 
introduced  by  the  authors  in  [3]  from  intuitive  economic  considerations. 

In  the  last  section,  we  treat  stochastic  right  hand  side  problems,  and 
generalize  the  results  in  [2]  on  the  duality  between  the  primal 
entropic  penalty  program  (DP) ,  and  the  problem  of  maximizing  the  classical 
certainty  equivalent  of  the  Lagrangian  dual  function  hb(y).  It  is  shown 
here  that  the  dual  problem  associated  with  (DP)^,  consists  of  maximizing 
the  new  certainty  equivalent  of  h^Cy). 

Finally,  it  is  perhaps  worthwhile  to  point  out  that  many  other  problems 
which  appear  in  a  variety  of  applications  (see  [8],  [21],  [22])  fit  the  formalism 
of  the  ({(-entropy  problem,  thus  can  benefit  from  the  duality  framework  develooed 
in  Section  3.  This  will  be  discussed  elsewhere  in  a  future  paper. 

2 .  The  ({(-Divergence  and  the  Induced  ^-Entropic  Penalty 

In  this  section,  we  discuss  some  properties  of  the  ({(-divergence  in  terms 
of  which  the  {(-entropic  penalty  is  constructed.  Let  T  be  a  locally  compact 
Hausdorff  space,  F  the  a-field  of  Borel  subsets  of  T,  dt  a  nonnegative 
regular  Borel  measure  (rBm)  on  T,  and  M(T)  the  linear  space  of  real-valued 


finite  rBm's  on  T. 
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Let  y^  and  be  two  probability  measures  which  are  assumed  ab¬ 

solutely  continuous  with  respect  to  dt,  we  denote  the  density  (Radon- 

dUi 

Nikodym  derivative)  of  y^  by:  f^(t)  *  . 

Assume  here  and  henceforth  in  this  paper  that  M  I+  1  is  a 
continuous  proper  convex  function,  so  dom  <j>  :*  (t:  <MO  <  «}  ^  <}>  and 

MO  >  Vt  £  I+.  The  class  of  such  functions  will  be  denoted  by 

For  a  given  function  <}>  e  0,  the  (^-divergence  of  the  distributions 
y^  and  y ^  is  defined  in  terms  of  their  densities  as: 


1  ’ 


» 

f^o' 

f2(t)  <t> 

[f2(t)J 

T 


(2.1) 


The  concept  of  (fr-divergence  (or  <j>-relative  entropy)  has  been  introduced  by 
Csiszar[6]  as  a  generalization  of  many  other  entropy-type  functionals,  widely 
used  in  statistical  information  theory  (see,  e.g.  [4] , [9] , [15] , [21] : 


Kernel  function 

The  ^-divergence 

Name /Source 

MO  =  tlogt-t+1 

Vfl’f2>  =  . 

1 

fl(t) 

£i(t)  108  f,(t> 

r  1 

(Kullback-Leibler  [10]) 

MO  =  ~  (1-t)2 

,  (f.(t)-fMt))2 

I$(fl,f2)  *  2  f  (t)  dt 

T  1 

(Kagan  [9]) 

*(t>  ■  A  +  1 

a  >  0,  a  #  1 

V£ rf2)  -S^rjf?<t)f2('>  dt  + 

T 

const . 

(a-order  divergence  [15 ] ) 

MO  -  (1-/0 2 

VL'V  - , 

(^-/f?2  dt 

r 

(Hellinger  distance  [4  ]) 

mo  -  i-t  1 

1<t(fl’f2) 

1 

|f1(t)-f2(t) |dt 

r 

(Variation  distance  [21]) 

TA3I.E  2.1.  uxmples  of  ^-divcrgcnco 


We  assume  here,  and  henceforth  that  <K1)  ■  0  and  11m  $(t)  = 

t+0+ 

0<Ktt)  *  0,  0<Ktt)  -  lim  c  $(— )  *  a  lim  a  €  (0,-H»).  All  the  examples 

u  u  e  t  * 

e-HD  C-x» 

in  Table  2.1  satisfy  these  requirements. 

The  following  result  follows  directly  from  £ 7 ,  Lemma  1.1],  it  explains 
why  I  can  be  used  as  a  measure  of  distance  between  two  random  variables. 


Proposition  2.1  The  ^-divergence  functional  (2.1)  is  well  defined  and 
non-negative.  It  is  equal  to  zero  if  and  only  if  f^  **  (a»®«) 

The  ^-divergence  also  possesses  an  important  convexity  property: 


Proposition  2.2  I  is  convex  in  each  of  its  arguments. 


Proof :  The  convexity  of  tj>  is  equivalent  (for  t  >  0)  to  that  of 


d>0 ( t)  :=»  td>  (— )  . 


[  f2(t)  , 

Vfl,f2}  "  \  (f2,fl}  -  j  fl(t)*o  f.(t)  dt 

O  m  I 


(2.2) 


Now,  the  convexity  of  I,  in  f,  is  obvious,  while  its  convexity 

9  1 

in  follows  from  (2.2). 


Adopting  the  concept  of  ^-divergence  and  following  the  definition 
of  the  entropic  penalty  P  given  in  (1.1),  we  define  now  a  generalized 
penalty  function  P  (•)  called  ^-entropic  penalty  as 


PA(x)  *  inf  ((  pfu(t)  <\>  f f dt:  (  g(x,t)f(t)dt  $  0  f 

b 
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Observe  chat  we  have  built  into  the  definition  of  P^  a  penalty  parameter 
p  >  0.  This  parameter  enables  the  decision-maker  to  control  the  size  of  the 
penalty  so  as  to  reflect  his  subjective  attitude  towards  constraints  viola¬ 
tions.  Note  that,  by  choosing  <J)(t)  *  t  log  t,  one  obtains  P .  *  p  •  P_  where 

(p  b 

P£  is  the  usual  entropic  penalty  (see  eg.  (1.1)).  In  terms  of  the  <j>-entropic 
penalty,  a  surrogate  for  the  stochastic  primal  (SP)  will  be  the  deterministic 
primal  problem: 


(DP)  inf  (gQ(x)  +  P, (x)}  . 

xeln 


Properties  of  the  <{>-entropic  penalty  and  of  the  induced  deterministic  program 
(DP)^  will  be  derived  via  the  duality  framework  developed  in  the  next  section. 

3 .  Duality  Theory  for  the  (fr-Entropic  Penalty  Problem 

* 

Let  X  and  X  be  real  vector  spaces,  and  <•»•>  a  bilinear  function 

*  * 

defined  on  pairs  (x,x*),  x  e  X,  x*  e  X  .  Let  X  and  X  be  equipped 

with  locally  convex  Hausdorff  topologies,  compatible  with  the  bilinear  form, 

so  that  every  element  of  one  space  can  be  identified  with  a  continuous  linear 

★ 

functional  on  the  other.  In  this  case  X  and  X  are  called  paired 
spaces  and  <  ,  >  is  the  pairing.  For  further  details,  see  [4]. 

Now  let  X  and  Y  be  real  vector  spaces.  A:  X  ■*  Y  a  linear  operator, 
h:  X  -*■  l  a  convex  function  with  dom  h  *  S  and  g:  y  -*•  1  a  concave  function 

with  dora  g  =  Q.  Consider  the  primal  problem: 

(A)  inf  (h(x)-g(Ax) :  x  e  S,  Ax  e  Q]  , 

the  Fenchel-Rockafellar  duality  theory  [20]  associates  with  (A)  the  dual 
problem: 

or  ;(t)  »t  log  t  -  t+1  (as  in  Table  2.1)  which  is  the  normalized  form,  i.e. 

: (1)  =■  0 
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(B)  sup  {g*(x*)  -  h*(A  x*) :  x*  e  Q  ,  A  x*  e  S  } 

*  *  *  *  * 
where  A  :  Y  -*•  X  is  Che  adjoint  of  A,  X  and  Y  are  the  spaces 

paired  with  X  and  Y  with  the  pairing  <*,*>,  <*,*>  respectively,  and 

x  y 

h*,g*  are  the  usual  convex  and  concave  conjugates  of  h  and  g,  i.e.: 


h*(*)  =  sup  {<x,*>  -h(x)} 
xeS  X 

g*(.)  =  inf  {<y,*>  -g(y)} 

yeQ  y 

*  * 

Further,  S  ■  dom  h*  and  Q  »  dom  g*  . 

The  value  of  the  <$>-entropic  penalty  at  a  given  point  x  is  obtained  as 
the  solution  of  the  infinite  dimensional  convex  optimization  problem: 


(P) 


inf 
f  e  ID 


pfb(t)  4> 


f(t) 


k  T 


f.(t)J 


dt 


(P  >  0) 


subject  to 


gt(x,t)  f(t)  dt  £  0  i  *  1, 


m 


(3.1) 


We  set  problem  (P)  in  the  format  of  the  convex  program  (A)  as  follows : 

m 


Consider  the  linear  operator  B:  M(T)  -*■  £  given  by 
gL(x,t)dy 


gm(x,t)dy 


and  the  integral  functional 


J(y)  :=  pi  (f,fb)  = 


pf b ( t )  0 


f  (t) 


lfb(t)J 


dt  if  y  is  an  absolutely 

continuous  2Bm,  and 


f  -  ^ 
dt 


Let  T  be  the  linear  function  y 


otherwise 

dy  .  Then  problem  (P)  can  be 


written  as: 
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inf  { J(m) :  BU  <  0  ,  TU  =  1} 


By  proposition  2.2,  J  is  a  convex  functional,  and  it  is  easily  seen 
that  (P)  corresponds  to  the  convex  program  (A)  with: 


S 3  dom  J,  X  :=  M(T) ,  Y  =  E  ,  h(*)  :=  J(*),  A  :-  T  , 

-v  W 

0  2 

a:=  1,  Q:={zelm  :  z  <_  a.}  and 

-1, 

-g(‘)  :=  6(*|Q)(=  indicator  of  Q)  .  Elements  x  €Y  are  given  as: 
* 

x  *  (y.n+,n_). 

Substituting  these  in  the  associate  dual  (B) ,  we  have: 


Lemma  3.1.  The  dual  problem  of  (P)  is  given  by 


sup  sup  (n  -  h*(n-ytg(x,t)) 
.._„m  nel 


Proof :  A  simple  computation  shows  that  g*(x*)  = 


n+~h_  if  x*  _<  0 


-oo  otherwise 


(3.2) 


(3.3) 


*  r  t 

so  Q  =  lx*  =  (y,n+,n_)  :  x*  <_  0}  .  Further,  we  note  that  the  adjoints  of 


A  and  T  are  respectively: 


*  2  .  .  *****  *•  ** 

A  :  3R  x®.  -^C(T):  Ax  *B  y+T  n+-T  n  *y  g(x,t)+T  h+-T  n_ 


(3.4) 


T  :  R-»C(T) 


T  s»s  (A  constant  function  in  C(T)) 


(3.5) 


where  C(T)  is  the  linear  space  of  continuous  function  on  T,  usually 
identified  with  the  dual  space  of  M(T).  Therefore,  substituting  (3.3),  (3.4) 
and  (3.5)  in  problem  (B)  with  n  :=  -  n  and  replacing  y  by  -y, 


we  obtain  the  desired  result. 


In  nroblem  (D) ,  the  dual  objective  function  is  expressed  in  terms  of 
which  here  is  the  conjugate  of  the  integral  functional  J(*).  The 
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conjugate  of  J  is  computed  in  the  following  result. 


* 

Lemma  3.2.  Let  <J>  €  the  conjugate  J  :  C(T) 

[I  pfb(t)  4>*(-^-)  dt  if  f(t) 


&  of  J  is  given  by: 

4^  .  u  abs.  continuous  (dt) 
dt 


* 

J  (f) 


(3.6) 


otherwise 


Proof :  Let  C(T)  be  the  space  of  all  continuous  functions  x:  T  -*■  I 
with  the  norm 


! x [ !  *  max  |x(t)  |  . 
teT 


Consider  the  integral  functional  I:  C(T)  -*■  1  given  by: 


I(x)  :=  pfb(t)  <{)*(^-)  dt 


F(t,x)  dt 


The  conjugate  of  I  is  then  by  definition: 

*  r  (  1 

I  (u)  *  sup  {  xdy  -  F(t,x)dt:  x  e  C(T)} 

T  T 

Using  general  results  on  the  computation  of  I  (see  e.g.  [18],  Theorem  4 
and  Corollary  4. A,  pp.  452-454)  it  is  easy  to  verify  that  for  a  given 

t) 

i  €  <5  and  ffa  e  F(t,x)  *  pffe(t)  — p  ^  satisfies  the  assumptions 

required  there,  and  so,  we  have 


I*(U) 


F*(t,  P)  if  U  e  M(T)  is  abs.  continuous  (dt) 
at 


otherwise 


(3.7) 


where  F  (t,x*)  is  the  conjugate  of  F(t,*)  at  x*  (for  fixed  t) 


Here  F  (t,x*)  =  sup  {xx*-pfb(t)<J*(^)  }  *  pfb(t)  sup  '  pfX^t  j —  j  •  (3. 
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Since  0  £  0,  hence  continuous,  we  have  0  m  0**  and  thus  from  (3.8) 

x*  ' 

0 J  x"  -  u 

F  (t,x*)  -  {  “  (3.9) 


pfb(t) 


otherwise 


Setting  x(t)  :*  f(t),  and  combining  (3.7)-(3.9),  we  have  obtained 


I  (U)  -  J(y) 


Moreover,  by  the  continuity  and  convexity  of  I,  we  have  also: 


J*(f)  =  I**(f)  -  1(f) 


*  * 

and  S  a  dom  J  a  C(T),  and  thus  (3.6)  is  proved. 


□ 


Combining  the  results  in  Lemmas  3.1,  3.2,  we  have  actually  proven 
that  the  dual  problem  of  (P)  is  given  by: 


(D) 


supm  sup 
y€l+  net 


(n  - 


pfb(t)0* 


fn-ytg(x,t)l 


dt} 


(3.10) 


a  finite  dimensional  concave  program  involving  only  nonnegativity  constraints. 

Duality  results  concerning  the  pair  of  problems  (P)  -  (D)  will  now 
follow. 


Theorem  3.1.  (a)  If  (?)  is  feasible,  then  inf(P)  is  attained  and 

rain(P)  =  sup(D)  . 

Moreover,  if  there  exists  a  density  f  e  satisfying  the  constraints  (3.1) 
strictly,  then  sup  (D)  is  attained  and 


min(P)  =■  max(D)  . 


(b)  Under  the  additional  assumption:  lim  $*( t)  <  ®  then: 

t-*— 00 

sup(D)  <  ®  if  and  only  if  (P)  is  feasible  . 

Proof :  (a)  The  result  follows  immediately  from  Rockafellar  [20]  (Theorems 

3,4,  pp.  178-179).  Indeed,  the  fact  that  the  dual  (D)  given  in  (3.10),  has 
only  non-negativity  constraints  y  >_  0,  it  safisfies  the  strongest  constraint 
qualification,  implying  that  (D)  is  stably  set,  hence  the  first  part  of 
conclusion  (a)  follows.  Also,  since  f  c  S.  satisfies  the  constraints  (3.1) 
strictly,  (i.e.,  the  familiar  slater  regularity  condition)  then  (P)  is  stably 
set  and  thus  the  second  part  of  (a)  is  proved. 

(b)  The  implication  (P)  feasible  *>  sup  (D)  <  ®  follows  trivially 
from  weak  duality  (without  any  assumption  on  the  problem  (P)).  We  prove 
now  the  reverse  implication: 

(P)  infeasible  «>  sup(D)  *  ® 

The  feasible  set  of  (P)  is 

(By  <_  0,  Tu  3  1,  U  nonnegative}  .  (3.11) 

Using  a  duality  theorem  for  linear  program  in  vector  spaces  (see  e.g. 

[13],  Theorem  3.13.8,  p.  68),  it  follows  that  (3.11)  is  infeasible  if  and  only 
if  the  system 

-B*y  +  T*n  <  0,  n  >  0,  y  e  1®  (3.12) 

*  * 

is  feasible.  (B  ,T  are  as  defined  in  (3.4)  and  (3.5)  respectively.) 

Thus  the  feasibility  of  (3.12)  implies  that: 

3v>0,  n  >  0 :  n  -  yCg  (x,t)  <  0, 


n  >  o 


(3.13) 


vrrj*  ’  v- 


By  talcing  (y,n)  £  *  H+  from  (3.13),  and  choosing  y  ■  My,  n  ■  Mri 


with  M  >  0,  the  dual  (D)  (see  eq.  (3.10))  becomes: 

sup(D)  >  sup  (Kh  -  pf,  (t)4  (M(-^-^  ^^X,t^))dt) 

M>0  4  P 

* 

and  since  lim  <h  (t)  <  ®,  the  sup  in  (3.14)  can  be  made  arbitrary  large. 
£-►  —  00 


(3.14) 


4.  A  Representation  of  the  4-Penalty  in  Terms  of  a  New  Certainty  Equivalent 


Throughout  the  rest  of  this  paper  we  deal  with  the  class  of  function 
$  €  4  which  are  strictly  convex,  essentially  smooth^  (see  [17],  Section 
26),  and  with  4'(1)  ■  0.  We  denote  this  class  by  4^. 


Recall  that  a  dual  representation  of  the  4-entropic  penalty  is  given 
by  (3.10)  as: 


P  (x)  -  sup  sup  { n  —  f  pf.(t)4*(^-2-^ 
*  y>0  n€»  J  6  1 


L)dt>  . 


(4.1) 


Let  us  introduce  the  utility  function  u  as: 


u(t) :-  -4  (-t) 


(4.2) 


Then  u  is  a  strictly  concave  essentially  smooth  function  (see  [17], 
Theorem  26.3)  with  u(0)  -  0,  u' (0)  »  1  (this  is  implied  by  4(1)  "0, 
4'(1)  *0).  Table  4.1  gives  the  utility  functions  corresponding  to  the 


kernels 


given  in  Table  2.1. 


(*)  All  functions  given  in  Examples  1-4  from  Table  2.1  are  indeed  essentially 
smooth. 


X  log  X  -  X  +  1 
\  (1  -X)2 

(1  -  ^o2 

1  a  a  . 

— -  - -  x+  1 

a-1  a-1 


Table  4.1:  The  utility  functions  corresponding 
to  the  information-kernel  functions. 


In  terms  of  u,  (4.1)  can  be  written  as: 


P  (x)  =  sup  sup  (n  +  pEu(^  H)} 

♦  y>0  n£]R  p 


A  little  algebra  shows  that  (4.3)  can  be  also  written  as: 


P  (x)  -  sup  p*sup  {n  +  Eu  (•*-“- 
'P  y>0  n€]R  P 


-  n)} 


For  a  random  variable  X,  let  us  define  the  quantity: 


S  (X) :-  p  sup  (n  +  Eu(—  -  n) ) 


The  latter  was  introduced  by  the  authors  in  [3]  (with  p  -  1) ,  and  termed  the 
new  certainty  equivalent  of  X. 

From  (4.4)  and  (4.5)  P,  is  given  by: 

P  (x)  -  sup  s  (ycg(x,b)),  (4.6 

*  y>0  P 

so  the  properties  of  P^  are  directly  related  to  those  of  the  new-certainty 
equivalent.  The  next  result  summarizes  some  basic  properties  of  S  (X). 
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Lemma  4.1  Let  p  >  0  be  fixed.  For  any  random  variable  X  and  a 

constant  w  £  H: 

(a)  Sp(w)  -  w 

(b)  S  (X)  <  E(X) 

(c)  S  (X  +  w)  -  S  (X)  +  w. 

P  P 


Proof:  (a)  By  definition,  S  (w)  «  p  sup  (n  +  u(— -n)}»  equating  the 

p  nem  p 

W 

derivative  of  the  supremand  to  zero  we  obtain  u'(— -n)  ■  1,  hence 
since  u' (0)  -  1  and  u*  is  strictly  decreasing  (as  a  derivative  of  the 


strictly  concave  function  u) ,  the  supremum  is  attained  at  n  ■  —  and 


its  value  is  then  S(w)  •  p  •  —  ■  w. 

P 


(b)  Since  u  is  strictly  concave,  u(x)  <  x  for  all  x  i  0  hence 


S  (X)  «  p  sup  {v\  +  Eu(—  -  n)l  <  p  sup  tn  +  E(—  -  q)>  »  E(X). 

p  n€m  p  n€H  p 


(c)  By  definition: 

S  (X  +  w)  ■  p  sup  {n  +  Eu(^^  -  n)}  , 

P  n€3R  p 


w 


hence  with  n  *  n - ,  one  obtains 

P 


S  (X  +  w)  *  p  sup  {n  +  —  +  Eu(—  -  n) }  *  w  +  S  (X) . 
p  n€3R  p  p  p 


The  additiving  property  given  in  Lemma  4.1  (c)  will  be  of  fundamental 
Importance  in  deriving  the  duality  results  of  the  next  section.  Note  that 
property  (b)  in  the  lemma  corresponds  to  risk  aversion  (concave  utility). 


Example  4.1  [Exponential  utility] 

*  t 

Let  $(x)  *  x  log  x-x+1,  then  $  €  Its  conjugate  is  $  (t)  » e  -  1, 

so  by  (4.2),  the  induced  utility  is  u(t)  *  1-e  C.  The  new  certainty 
equivalent  is  then 

Sp(X) --plog  E  e‘X/p  . 

Here  the  new  certainty  equivalent  coincides  with  the  classical  certainty 
equivalent  corresponding  to  the  utility  function  u(t)  "1-e  C^p,  i.e., 

S  (x)  »  u  ^Eu(X) . 

P 

The  parameter  p  is  exactly  the  reciprocal  of  the  Arrow-Pratt  risk  aversion 
indicator  (-u"/u'),  see  [14]. 

Example  4.2  [Quadratic  utility] 

12  t2 

Let  <p  (x)  ■  (x  -  1)  ,  then  u(t)  »  t  — and 

S  (X)  -  p  i  a2 
P  2p 

where  u  is  the  mean  of  X  and  a2  the  variance. 

Example  4.1  showed  that,  for  exponential  utilities,  1/p  is  exactly 
the  classical  Arrow-Pratt  risk  indicator.  This  role  of  1/p  as  a  measure 
of  risk,  is  further  explored  in  the  next  two  results,  which  incidently 
provide  a  generalization  of  Theorems  1  and  2  derived  in  Bamberg  and 
Spremann  [1]  (proved  there  for  the  case  of  exponential  utilities  only). 

Lemma  4.3(a)  below  will  be  also  of  particular  importance  in  deriving  the 
duality  results  of  Section  6. 
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Lemma  4.2  lim  S  (X)  *  E(X)  [Risk.  Neutrality] . 

p-*+®  p 

Proof:  Let  a:«  —  and  define 

-  p 

v(ot)  :*  sup  {n  +  Eu(aX-n)}  (4.7) 

n 

Equating  the  derivative  of  the  supremand  to  zero  we  obtain: 

Eu’ (aX  -  n)  -  1  • 

Since  u'  is  strictly  increasing,  by  the  implicit  theorem  we  have 

v(a)  ■  n(a)  +  Eu(aX  -  n(<*)),  (4.8) 

where  n(a)  is  the  unique  solution  of 

Eu'(aX  -  n(a))  ■  1  .  (4.9) 

From  (4.9),  u'(-n(0))  ■  L  and  then  n(0)  »  0. 

In  terms  of  v(a) ,  we  have  with  o:»  — 

S  (X)  -  ZteL  .  (4.10) 

P  « 

Thus,  lim  S  (X) '  *  lim 

p-**  p  a-K)  a 

and  by  L'Hopital  rule  we  get: 

lim  S  (X)  -  lim  —  «  lim  [n'  (a)  +  E((X-  n’  (a))u’  (aX-  n(a))]  -  E(X)  . 

p-*»  p  a-K)  1  a-K) 

□ 
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(a)  V£>0  11m  S  (X)  <  ^  +  €  . 

p-*-0  P 

(b)  If,  in  addition,  u  is  strictly  increasing  we  have 


lim  S  (X)  -  X. 
p-K) 


[Risk  averse]. 


Proof ;  (a)  A  (one  dimensional)  special  case  of  problem  (P)  is: 

Xr 

(P),  inf  (pi, (f,fY):  g(t)f(t)dt  <  0> 
f€Dl  Xl 

Consider  now  the  problem  (Pg)  obtained  from  (P)^  with  g(t) :«t  -  X^  -  €, 
namely 

Xr 

* 

(Pc)  inf  {pI.(f,fY):  tf(t)dt  <  X.  +  €} 

e  f€D  *  v  ^ 

*L 


The  dual  of  (Pg) ,  obtained  from  (4.6)  is: 
sup  S  (y(X  -  X^  -  €)) , 

V  A  P  “ 


and  using  the  additivity  of  Sp  (Lemma  4.1(c))  and  (4.7),  it  becomes: 

(D  )  sup  {S  (yX)  -y(XL  +  €)  >  . 
y  >0  P 

Problem  (Pg)  is  clearly  feasible  V€  >0,  and  so  from  weak  duality 
between  the  pair  (Pg)  -  (D  ) ,  we  have: 


(Pg)  feasible  sup(Dg)  <  <*> 


(4.11) 


Renee , 


sup(D  )  >  lim  yl'^^-(XL  +  6) 


V. 


* J  . .  ---  ^  » 
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i.  e. , 


S  (yX) 


<  x,  +  e  . 


(4.12) 


Now,  ic  is  easily  verified  chat 
(4.12). 


s„ (yx) 


S^^(X),  hence  (a)  follows  from 


(b)  a  >  0  aX  -  n  aX^  -  n  and  since  u  is  assumed  strictly 
increasing  we  have: 


n  +  Eu(aX  -  n)  >,  n  +  u^a^L  " 


S  (X)  >_p*sup  (n  +  u(~±  -  n) } 


The  latter  supremum  is  easily  computed  to  be  X^  and  so 
S  (X)  »  ^ 


This  combined  with  (a) ,  proves  (b) 


5.  Properties  of  P ,  and  a  Min-Max  Representation  of  (DP) 


In  this  section  we  derive  the  properties  of  P^  and  discuss  its 

appropriateness  in  treating  the  stochastic  program  (SP) ,  using  (DP).. 

<P 

Theorem  5 . 1  For  any  $  €  the  $-entropic  penalty  function  P^ 
satisfies: 


P.(x) 


positive 


if  Eg(x,b)  <  0 


if  Eg(x,b)  £  0 


*  * 

(ii)  Under  the  additional  assumption  (AX) :  lim  <j>  (t)  <  ® 

P.(x)  ■  «  if  for  some  i  g  (x):-  inf  g. (x,b)  >  0. 

9  _i  b6T  1 

Proof:  (i)  Let  Q(x,y):«  Sp(yCg(x,b))  then 

P  (x)  -  sup  Q(x,y)  >_  Q(x,0)  ■  S  (0)  -  0  (5.1) 

9  y>0  P 

The  later  equality  comes  from  Lemma  4.1(a). 

Now  using  Lemma  4.1(b)  we  have: 

Q(x,y)  <,  ytEg(x,b) 

with  equality  only  for  y  •  0  and  so 

P  (x)  -  sup  Q(x,y)  <_  sup  ytEg(x,b)  . 
y>0  y>0 

If  Eg(x,b)  <_  0,  the  last  inequality  shows  that  P^(x)  £  0  which  together 
with  (5.1)  proves  the  first  part  of  (i) . 

Assume  now  that  for  some 


i  €  [l,m] ,  Egt(x,b)  >  0. 

Let  Qi(x,yi):«Q(x,0,...,yi,...,0)  -  Sp(yigi(x,b)) . 

Then,  we  have  Q. (x,0)  ■  S  (0)  «  0. 

i  P 

Moreover, 

y.g.(x,b) 

S(y,g.i  (x,b))  -  p*sup  (n  +  Eu( - n)}  . 

p  1  n€R  p 


(5.2) 


(5.3) 


y± 

Since  $  €  $.,  then  the  function  ip  (y ,  ,n)  :»Eu'  (  —  g.  (x,b)  -  n)  is 

1  XX  p  X 

d  yi 

continuously  differentiable  on  1R  x  R  and  -r-  (y,n)  •  -Eu"( —  g  (x,b)  -  n)  >  0. 

■f  cin  X  Q  JL 


*  This  assumption  holds  for  Examples  1  (with  a  < 1) ,  2  and  4  in  Table  4.1 


*  .*  .  .*  *.  •' ,  -*«  ’ 


“  *  ‘  r  rn*^  V,  , 


i 


V  /. 
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Hence,  by  the  Implicit  Function  Theorem,  there  exists  a  unique  solution 
n  *  n(y^)  to  the  equation  i|»x(y^,n(y^))  *  1,  and  n(0)  »  0.  By  the 
definition  of  S(y^g^(x,b))  in  (5.3),  as  an  unconstrained  concave  optimiza¬ 
tion  problem,  ah  explicit  expression  of  it  is  obtained  by  equating  the 
derivative  of  the  supremand  to  zero,  and  therefore: 

y,g, (x,b) 

Qi(x,yi)  *  Sp(yig^(x,b)  ■  p(h(y^)  +  Eu( - - - n(yi)))  (5.4) 

where  n(y^)  is  the  unique  solution  of 

+x(y1,1')  ■  1  •  (5.5) 

Now,  an  easy  computation  shows  that: 

Vx,yi*  *  Egi(x,b)*u’(-n(0)), 

yi  Jy^O 

but  n(0)  ■  0  and  u' (0)  ■  1,  so  we  have  under  assumption  (5.2): 

d7TQiU’yi)]  n  *  Egi(x’b)  >  °- 
7 1  yi”° 

A 

Therefore,  there  exist  y^  >  0  (close  enough  to  zero)  such  that 

QiU.yp  >  Qi(x,0)  -  0  (5.6) 

Noting  that 

P,(x)  -  sup  Q(x,y)  >_  sup  Q  (x,y.)  >  Q.(x,y.)  >  0, 

9  0ly£Hn  05yk£E  1  1 

and  this  proves  the  second  part  of  (i) . 
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(ii)  Let  g^(x)  >  0  for  some  i.  The  latter  means  that  problem  (P) 

(see  (3.1))  is  infeasible,  hence  under  assumption  (Al) ,  invoking  Theorem 

3.1(b),  this  implies  that  P,(x)  *  ®  . 

*  □ 

The  first  part  of  the  theorem  demonstrates  that  P^(x)  is  a  penalty 
function  for  violation  of  the  constraints  in  the  mean.  The  second  part 
shows  that  has  the  desirable  property  of  excluding  solutions  which 

are  not  feasible  in  (SP)  ,  for  any  realization  of  b,  i.e.,  for  x  which 
is  infeasible  with  probability  1,  P^(x)  ■  00 . 

Therefore,  a  suitable  deterministic  surrogate  problem  for  (SP)  is 

(DP),  inf  {g  (x)  +  P (x) }  . 

*  xem11  0  0 

From  the  additivity  of  he  new  certainty  equivalent  (Lemma  4.1(c)) 

inf  { gQ (x)  +  P  (x)>  "  inf  sup  S  (gQ(x)  +  ytg(x,b)) 
x  x  y>0  P 

hence  (DP)^  can  be  written  as  a  minimax  problem: 

(DP)^  inf  sup  S  (L^x.y))  (5.7) 

x  y>0  p 

where  L^x.y)  -  gQ(x)  +  yCg(x,b)  is  the  classical  Lagrangian  correspond¬ 
ing  the  the  original  problem  (SP) .  This  result  generalizes  Theorem  2  of  [2] , 

We  close  this  section  by  giving  an  explicit  expression  of  PA  for  the 

<P 

familiar  chance  constraints  problem  [6]. 

Example  5.1:  Consider  the  well  known  chance  constrained  program 

(CC)  inf  { gQ(x)  :  Pr  {g(x)  >_  b}  <_  a}*  (5.8) 

wuich  is  a  special  case  of  the  deterministic  program  : 

inf  £gQ(x) :  Eg(x,b)  £0}  (5,9) 

*  For  simplicity  we  will  treat  here  only  the  case  of  a  single  constraint, 
i.e. ,  g:  Rn  •»  R. 


by  choosing: 


|  1-a 

if  g(x)  >_  b 

g(x,b)  -  * 

i  -« 

if  g  (x)  <  b 

a  £  (0,1) 

(5. 

Using  the  dual 

representation  of  P 

^  given  in  (A. 4)  we  have: 

P  (x)  -  sup  p 

*  0<y€]R 

-00 

sup  (n  +  f. 
n€]R  J  b 

—CD 

(t)u(^-  g(x,t)  -  n)dt) 

P 

(5. 

Recalling  from 

(5.10) 

that  here: 

'  1-a 

if  g(x)  >_  t 

g(x,t)  -  > 

otherwise 

a  6  (0,1) 

we  get  from  (5.11)  in  term  of  the  cummulative  distribution  function  F(*) 
of  b:  (denoting  F:-F(g(x))): 

P  (x)  -  sup  p  sup  {n  +  Fu(y^-~a~-  -  n)  +  (l-F)u(-  ^-n))  •  (5. 

*  y>o  nem  p  p 

Let  us  define  n:«  -  ^  -  n,  then  (5.12)  becomes 

P 


P.U) 

<P 


sup  {  - 
n€H 


n  +(1-F)u(n)  +  sup  {  -  +  FuCn  +  } 

y>0  P  P 


(5. 


The  inner  supremum  in  (5.13)  is  computed  first;  by  simple  calculus,  the 

.  *  i 

maximizing  y  is  y  given  by: 


* 


yi 


p(u')  1(  j  )  -  n 

i  0 


if  |>u'(n) 
otherwise. 


(The  existence  of 


(u1) 


-1 


is  guaranteed,  since 


uf  is  a  derivative  of  a 


strictly  concave  function.) 
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Substituting  y  in  (5.13)  yields: 


p  sup  {(l-F)u(n)  -  n(l-a) }  +  if  f  >  u'(n) 

n  6B. 


(5.14) 


otherwise 


where  Kt):»  -t*{^  (u’)_1(  ^  )  -  u((u’)-1(  ^  ))}  . 


(5.15) 


The  latter  expression  can  be  simplified  by  observing  that 


u*(x*)  -  inf  (xx*-u(x)}  »  x*(u’)~^(x*)  -  u((u')_1(x*) 


(5.16) 


(i.e.,  the  Legendre  Transform  of  u) ,  but  by  (4.2)  we  know  that 


u(x)  -  -4>  (-x) ,  hence  u*(x*)  ■  — (x*) 


using  the  latter  in  (5.16)  at  the  point  x* 
simple  expression: 


—  ,  (5.15)  reduces  to  the 


<Kt)  ■  t<j>(  ^  ) 


(5.17) 


It  remains  to  compute 


sup  {(l-F)u(n)  -  n(l-a)} 
n€H 


(5.18) 


Equating  the  derivative  of  the  supremand  to  zero,  we  get  the  optimal 
n*  from: 

,  /  l^Ct 

tt  -  Pf 


which  by  (5.1M  must  satisfy  |r  —  (n*)  *  ;  then  substituting 

in  (5.13),  after  some  algebra,  we  finally  get  from  (5.14): 


Vx) 


p-(F(g(x))»(p(g^)})  +  (1-F(g(x))»(ir^x)))}  if  F(g(x))>a 


(5.19) 


if  F(g(x) ) <  a 
i.e.  Pr{g(x)>.b}  £  a 
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Note  that  the  function  h(t):-t$(  ^  ">  +  ( 1—  c ) (  -j—  )  in  term  of  which 

is  expressed,  is  convex  and  increasing  0  <  a  <_  t  <  1  and  h(ot)  *  0, 
so  by  this  and  (5.19),  P^(x)  ■  h  (max{<x,F(g(x)  }).  It  follows  that  if 

F(g(x))  is  convex  so  is  P^.  (Compare  these  results  with  [2].) 


In  this  section  we  treat  the  special  case  of  the  general  problem  (SP) : 


(SP)  inf  (gQ(x)  :  g^(x)  >_  bi  i»l,...,m} 


which  is  obtained  from  (SP)  by  setting  g(x,b) :*b  -  g(x) . 

The  penalty  function  P^  is  given  here  by: 

P.(x)  -  sup  S  (yt(b  -  g(x)))  . 
y>0  P 

However,  by  the  additivity  of  Sp,  since  ytg(x)  is  not  random,  we  can 
write  P.  as: 


The  corresponding  deterministic  primal  (DPA  -RHS):  inf(g  +  P,(x)}  is  then: 

o  $ 

(DP.  -  RHS)  inf  sup  K(x,y) 
x  y>0 

with 

K(x,y):»  gQ(x)  +  w(y)  -  yCg(x)  (6.2) 

m 

Assume  now,  that  gQ(x)  is  convex,  and  that  {g^(x)}j_^ 


are  concave 


functions,  so  (SP-RHS)  is  a  convex  program.  Also  P ,  (x)  becomes  convex 

9 

and  so  (DP^  -  SHS)  is  a  convex  program. 

We  define  the  dual  problem  corresponding  to  (DP^  -  RHS)  by: 

(DD  -RHS):  sup  inf  K(x,y) 

y>0  x 

The  main  result  of  this  section  is  a  strong  duality  relation  between  the 

pair  (DP  -  RHS)  and  (DD  -RHS). 

9  $ 

Theorem  6.1  Let  (SP-RHS)  be  a  convex  stochastic  program  and  consider  the 
corresponding  deterministic  program  (DP^  -  RHS) . 

If  the  following  condition  holds: 

3  x  €  ]R  such  that  g±(x)  >  b±  V  i  €  I  (6.3) 

where  b^  denote  the  infinum  support  of  b^. 

Then: 

inf  (DP^-RHS)  »  max(DD^  -  RHS)  .  (6.4) 

Proof:  Since  gQ(x)  is  convex  and  {g^x)}^  are  concave  then  K(*,y) 

given  in  (6.2)  is  convex  for  every  y  >  0. 

How  the  function  w(»)  given  in  (6.1)  can  be  rewritten  as: 

w(y)  -  -  inf  F(n,y), 
n€H 

where  F:  1R  *]Rm  -*■  K  is  defined  by  F(n,y)  *  -pn  -  pEu(^-^-  -  r\) ; 

P 

since  u  is  concave  it  follows  from  [19],  Theorem  1  that  F(n,y)  is 
(jointly)  convex,  hence  w(y)  is  concave  and  therefore  K(x,0  is  concave. 

By  a  result  in  [16],  a  sufficient  condition  for  the  validity  of  (6.4) 
for  a  general  convex-concave  saddle  function  K(x,y)  is: 

2  yQ  >  0  such  that  y£vyK(x,y)  >_  0  (x  €  lRn  ,  y  >  0)  (6.5) 


With  analogous  proof  to  the  one  given  in  Theorem  5.1  (eqs.  (5.4)- 


(5.5),  w(y)  is  expressed  as 


w(y)  -  p(n(y)  +  Eu(^ - n(y)) 


where  n(y)  is  obtained  from  Eu' (-^p-  -  n(y))  ■  1. 


.  _ 

A  P 

Thus,  Vw(y)  ■  E(bu*  (-^ - n)),  hence  (6.5)  is  here: 

j!  yQ  >  0  such  that  yt{E(bu1  (^~  -  n(y))  -  g(x))}  £  0 

V  x  6  Em  ,  y  >  0 . 


The  later  is  certainly  satisfied  if: 
1  x,  y  >  0  such  that 


t, 

Vw(y)  »  E(bu'(^ - m(y) )  <  g(x) 


To  show  that  condition  (6.3)  implies  (6.6)  it  suffices  to  prove  that: 


inf  w(y)  £  b 
y>0  3yi  1 

For  all  i  €  [l,m],  let 


Vi  *  1,...  m  . 


Vi 


W. (y.):-w(0,...,y. ,...  0)  -  p  sup  (n  +  Eu( -  n) 

1  nEB.  p 


Noting  that 


inf  —  w(y)  <  inf  ——  w  (y . )  Vi 
3y-  -  5*i  1  1 


0£y€lRm  3yi 
to  prove  (6.7)  it  suffices  to  prove  that 


inf  w’ (y  )  £  b. 

Q<yt 

Now  w^(y)  is  concave  and  w^(0)  -  0,  hence  by  the  gradient  inequality 
0  »  w.  (0)  £  w  (y  )  -  y  w!  (y.) 
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and  thus 

w(yt) 

lim  w!  (y  )  <  lim  -  (6. 

,  ii—  y . 

y£*-+oo  y  ^  oo  ■'i 

But  w^  is  a  derivative  of  a  strictly  concave  function  and  thus  is 
strictly  decreasing  hence  inf  w’(y.)  ■  limw'(y.). 

y±>o  1  y±— 

Moreover,  using  (4.7)  we  have  the  relation: 


W  (y  )  -  y  ^2 1 

yi  a 


with  a:*  —  (p  >  0) ,  then  from  (6.9)  and  Lemma  4.3(a)  we  get  the 
desired  result  (6.8). 


The  dual  problem  (DD^-RHS)  is  given  by 


sup  inf  K(x,y)  . 
y>0  x 

To  get  a  full  meaning  of  this  dual  we  fist  prove: 


Proof : 


S  (inf  L,  (x,y))  ■  S  (yS  +  inf  (gQ(x)  -  yfcg)x))) 
p  x  p  x 

»  S  (yS)  +  infig  (x)  -ytg(x)>  [by  Lemma  4.1(c)] 
p  x 

*  inf  (S  (yS)  +  gQ(x)  -  yCg(x)} 
x  p 

■  inf  {S  (yS  +  gQ(x)  -  ytg(x)}  [by  Lemma  4.1(c)] 
x  p 
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Now  K(x,y)  given  in  (6.2)  can  be  written,  using  again  the 
additivity  property  of  Sp  (Lemma  4.1(c))  as: 

K(x,y) » Sp(Lb(x,y))  . 

Hence,  by  Lemma  6.1,  the  dual  problem  (DD^  -  RHS)  is 

(DD  -RHS)  sup  S  (inf  L.  (x,y))  . 

*  y>0  P  x 

Thus  we  have  shown  that  while  in  the  deterministic  case,  the  Lagrangian 

dual  of  (SP)  is  the  concave  program:  sup  inf  L.  (x,y),  in  the  stochastic 

y>0  x 

case,  the  dual  program  (DD^  -  RHS)  consists  of  maximizing  the  new  certainty 
of  the  Lagrangian  dual  function. 

This  result  generalizes,  to  arbitrary  utilities,  a  result  in  [2, 
Theorem  4],  proved  for  the  exponential  utility. 
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